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Human colorectal cancer-specific CCAT1-L IncRNA 
regulates long-range chromatin interactions at the MYC 
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Jian-Feng Xiang\ Qing-Fei Yin', Tian Chen', Yang Zhang', Xiao-Ou Zhang^, Zheng Wu', 
Shaofeng Zhang', Hai-Bin Wang\ Junhui Ge\ Xuhua Lu , Li Yang^, Ling-Ling Chen' 

'state Key Laboratory of Molecular Biology, Institute of Biochemistry and Cell Biology, 'Key Laboratory of Computational Biol- 
ogy, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sci- 
ences, 320 Yueyang Road, Shanghai 200031, China; ^Changzheng Hospital, Second Military Medical University, 415 Fengyang 
Road, Shanghai 200003, China 

The human 8q24 gene desert contains multiple enhancers that form tissue-specific long-range chromatin loops 
with the MYC oncogene, but how chromatin looping at the MYC locus is regulated remains poorly understood. Here 
we demonstrate that a long noncoding RNA (IncRNA), CCATl-L, is transcribed specifically in human colorectal 
cancers from a locus 515 kb upstream of MYC. This IncRNA plays a role in MYC transcriptional regulation and 
promotes long-range chromatin looping. Importantly, the CCATl-L locus is located within a strong super-enhancer 
and is spatially close to MYC. Knockdown of CCATl-L reduced long-range interactions between the MYC promoter 
and its enhancers. In addition, CCATl-L interacts with CTCF and modulates chromatin conformation at these loop 
regions. These results reveal an important role of a previously unannotated IncRNA in gene regulation at the MYC 
locus. 
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Introduction 

The increased sensitivity of experimental assays has 
revealed that long noncoding RNAs (IncRNAs) impact a 
variety of important biological processes (reviewed in [1, 
2]). Aberrant expression of IncRNAs has been linked to 
cancers with distinct modes of action (reviewed in [3]). 
For example, HOTAIR is highly expressed in breast tu- 
mors and has been reported to promote cancer metastasis 
by targeting chromatin repressor Polycomb proteins to 
specific genomic loci [4]. LincRNA-p21 in association 
with hnRNP-K serves as a repressor in p53 -dependent 
transcriptional responses [5] or suppresses target mRNA 
translation in coordination with the RNA-binding protein 
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HuR [6]. MALATl has been implicated in the regulation 
of cell growth and tumor metastasis [7, 8]. These findings 
suggest that IncRNAs may serve as important regulators 
in tumorigenesis although the expression regulation of 
IncRNAs in specific human tumors and their mechanisms 
involved in tumorigenesis remain to be explored. 

Enhancers are a class of DNA regulatory sequences 
that can activate gene expression independent of their 
proximity or orientation to their target genes [9]. En- 
hancers often form long-range chromatin loops with their 
target genes to control temporal- and tissue-specific gene 
expression during development and their mis-regulation 
contributes to human diseases [10]. A large portion of en- 
hancers can be transcribed into enhancer RNAs (eRNAs) 
[11], which have been proposed to contribute to gene 
activation [12-16]. In addition, super-enhancers were 
recently identified and shown to consist of large clusters 
of transcriptional enhancers formed by binding of master 
transcription factors/mediators and to be associated with 
genes that control and define cell identity [17, 18]. Thus, 
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it would be interesting to know whether super-enhancers 
are transcribed and whether they are regulated by RNA 
transcripts. 

The expression of the human MYC oncogene is com- 
plex and is regulated at multiple levels, including en- 
hancers, promoters, transcription factors and chromatin 
state [17, 19-21]. The human 8q24 region includes a gene 
desert containing enhancers forming chromatin loops 
with the MYC promoter located several hundred kilobas- 
es telomerically. These chromatin interactions are tissue- 
specific in prostate, breast and colorectal cancers [19]. 
In colorectal cancer (CRC), one well-characterized loop 
is between a locus 335 kb upstream of MYC (M7C-335) 
and the MYC promoter. MYC-335, the site harboring an 
important CRC risk SNP (rs6983267), is a transcriptional 
enhancer that promotes the binding of transcription factor 
4 (TCF4) specifically in CRC [22, 23]. Importantly, mice 
lacking MYC-335 were resistant to intestinal tumors, 
although MYC transcripts were only modestly reduced 
[24]. Very recently, the region upstream of M7C has been 
reported to contain an exceptionally large super-enhancer 
[17] and such a super-enhancer is tumor type specific 
in cancer cells, but not in its healthy counterparts [20]. 
However, how these chromatin loops at the 8q24 MYC 
locus are regulated remains unknown. 

Human 8q24 has recently been reported to express 
several IncRNAs in different human tumors. PRNCRl 
(8q24) [25] binds to the androgen receptor (AR) and is 
involved in the AR-mediated gene activation in pros- 
tate cancers [26]. However, it is not expressed in CRC 
(Supplementary information. Figure SI). Instead, two 
other CRC-specific IncRNAs transcribed from 8q24 were 
recently reported. CCATl (Colorectal Cancer Associated 
Transcript 1) is 2 600 nt in length and is a highly specific 
marker for CRC [27], and its upregulation is evident in 
both pre-malignant conditions and through all disease 
stages in CRC [28]. CCATl, a 340 nt ncRNA transcribed 
from the MYC-335 region, appeared to enhance invasion 
and metastasis through MYC-regulated miRNAs miR- 
17-5p and miR-20a [29]. However, we have been unable 
to detect CCATl in any human CRC tissue samples or 
CRC-derived cell lines examined (Supplementary infor- 
mation. Figure SI). 

Here we report that a novel 5 200 nt CRC-specific 
IncRNA, CCATl-L {CCATl, the Long isoform), is tran- 
scribed from a locus 5 1 5 kb upstream of MYC {MYC-5 1 5), 
a super-enhancer region of M7C [17, 20], and plays a 
role in MYC transcriptional regulation. We demonstrate 
that CCATl-L localizes to its site of transcription and 
functions in the maintenance of chromatin looping be- 
tween the MYC promoter and its enhancers in coordina- 
tion with CTCF. Together, these results reveal a novel 



connection between the chromatin organization regulated 
by a IncRNA and MYC expression in a specific human 
cancer. 

Results 

CCATl-L, a novel CRC-specific IncRNA, is abundantly 
transcribed fi'om a locus 515 kb upstream of MYC on 
8ql4 

By sequencing paired CRC/control mucosa samples 
from a Chinese patient, we have identified a previously 
unreported IncRNA, CCATl-L, that is abundantly and 
specifically expressed in CRC (Supplementary informa- 
tion. Table SI and Figure SI). CCATl-L is transcribed 
from 8q24.21 — 515 kb upstream of the MYC locus 
(M7C-515) (Figure lA). It is 5 200 nt in length, contains 
two exons and is polyadenylated as revealed by oligo 
(dT) selection followed by northern blot (Supplementary 
information. Figure S2A). 3' RACE further confirmed its 
3' end and the splicing between the two exons (Supple- 
mentary information. Figure SIA and data not shown). 
Interestingly, CCATl, a 2 600 nt IncRNA identified using 
Representational Difference Analysis (RDA) and cDNA 
cloning [27], was shown to be highly associated with 
pre-malignant as well as all disease stages in colon can- 
cer tumorigenesis [27, 28]. Importantly, CCATl-L that 
we identified here overlaps with CCATl (Figure lA and 
IB), suggesting a positive correlation of CCATl-L ex- 
pression with colon cancer. For clarity, we refer the 2 600 
nt CCATl as CCATl-S throughout this study. 

We further confirmed extensive transcription of 
CCATl-L in CRC tissue samples from several other 
patients by northern blots and CC4r/ -Z-specific RT- 
qPCR (Figure IC and ID). Consistent with the expres- 
sion pattern of CCATl-S [27], the expression of CCATl- 
L is undetectable or very low in paired mucosa samples 
(Figure IC and ID) and other normal human tissue 
samples (Supplementary information. Figure S2B). The 
same northern blot probe also recognizes CCATl-S, but 
CCATl-S is expressed at a much lower level compared 
to CCATl-L in human CRC patient tissue samples exam- 
ined (Figure IC). In addition, we observed that CCATl - 
L is specifically expressed in cultured CRC-derived cell 
lines, such as HCT116, HT29 and SW48 (Figure IE). 
Finally, sequence conservation analysis revealed that 
CCATl-L is human specific and no ortholog was seen in 
other species examined (data not shown). 

The human 8q24 gene desert region has recently been 
shown to express distinct IncRNAs in different human 
tumors [25, 27, 29] (Supplementary information. Figure 
SIB). However, these IncRNAs were undetectable in the 
sequencing data of paired CRC/control mucosa samples 
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(Supplementary infomiation, Table SI and Figure SIC), 
or could not be detected in the CRC-derived cell lines 
that we examined (Supplementary information, Figure 
SID). Thus, the significance and functional implications 
of these RNAs in CRC remain unclear. 



CCATl-L is a nuclear-retained IncRNA 

As the two CCATl isoforms overlap, we decided to 
characterize them in greater detail. First, both RNAs 
are polyadenylated (Supplementary information. Figure 
S2A), with halflife of about 6-8 h in HT29 cells (data 
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Figure 1 CCAT1-L, a nuclear-retained IncRNA, is specifically expressed in human CRC tissue samples. (A) RNA-seq of 
paired CRC/control mucosa samples from a Chinese patient revealed that a novel IncRNA, CCAT1-L, is transcribed from a 
locus 515 kb upstream of the MYC locus (/WYC-515) on 8q24. Note that we refer the previously annotated 2 600 nt CCAT1 as 
CCAT1-S throughout this study [27]. (B) A schematic view of CCAT1 locus and its adjacent genomic information on 8q24. The 
locus 335 kb upstream of the MYC locus (MYC-335) contains an enhancer and a CRC risk SNP [22, 23]. Red lines denote 
antisense (AS) probes recognizing either both CCAT1 isoforms (#1 ) or only CCAT1-L (#2) in northern blot; arrows denote 
PCR primer sets that recognize either both CCAT1 isoforms (#3) or only CCAT1-L (#4). (C) Northern blot validated CCAT1- 
L expression in human CRC patient samples. (D) The relative expression of CCAT1-L in CRC tissues and paired control 
mucosa samples from the same patients. The primer sets only recognizing CCAT1-L was used (#4 in B). P values from one- 
tailed f-test in the painwise comparison are shown. (E) Northern blot confirmed CCAT1-L expression in human CRC cell lines. 
(F) CCAT1-L is associated with the nuclear insoluble fractions. Total RNAs from HT29 cells were separated into cytoplasmic, 
nuclear soluble, and nuclear insoluble fractions. Bar plots represent relative abundance of RNAs in the nuclear soluble and 
insoluble fractions as measured by RT-qPCR. #3 or #4 described in B were used to detect either both CCAT1 isofroms or 
CCAT1-L only. Error bars represent standard deviation (± SD) in triplicate experiments. (G) CCAT1-L is exclusively nuclear 
retained, while CCAT1-S is cytoplasmically distributed. RNA ISH (green) was performed with Dig-labeled probes (B) either 
recognizing CCAT1-L (top panel) or both isoforms of CCAT1 (bottom panel) in HT29 cells. (H) CCAT1-L accumulates at its 
site of transcription. Double FISH of CCAT1-L (green) and its adjacent DNA region (red). A single Z stack of representative 
images acquired with an Olympus 1X70 DeltaVision Deconvolution System microscope is shown. DAPI is in blue and the 
white scale bar in all images denotes 5 iim. Representative images are shown (G, H). In C and E, 18S and 28S rRNAs were 
used as loading controls. Supportive data are included in Supplementary information. Figures SI and S2. 



not shown). Second, Jfnoclcdown of CCATl-L by an 
optimized pliospliorothioate-modified antisense oligode- 
oxynucleotide (ASO) led to the simultaneous disruption 
of CCATl-S (Supplementary information, Figure S2C), 
suggesting that the short isoform may be derived from 
CCATl-L. Further analysis revealed alternative polyad- 
enylation (APA) sites in CCATl-L at a genomic position 
close to the 3' end of CCATl-S (data not shown). Third, 
we observed that CCATl-S and CCATl-L are localized to 
different subcellular compartments. By nuclear/cytoplas- 
mic RNA fractionation, we found that CCATl-L shows 
an association with chromatin as tight as Xist, a IncRNA 
that regulates chromatin compaction during X-chromo- 
some inactivation [30] (Figure IF). Fractions with both 
CCATl isoforms revealed a less tight association of 
these RNAs with chromatin (Figure IF), suggesting that 
CCATl-S is cytoplasmic. This was further confirmed by 
RNA in situ hybridization (ISH) with probes that either 
detect CCATl-L only or both CCATl isoforms. CCATl- 
L is exclusively located in the nucleus and accumulates 
in striking nuclear foci in CRC cell lines examined 
(Figure IG, top panel, and data not shown). In contrast, 
when a probe recognizing both isoforms of CCATl was 
used in the ISH, we found that fluorescent signals ap- 
peared in both cytoplasm and nucleus (Figure IG, bot- 
tom panel), suggesting that CCATl-S is located in the 
cytoplasm. This is in agreement with the previous report 
that CCATl-S is a cytoplasm-located IncRNA [31]. Fi- 
nally, we found that the nuclear-retained CCATl-L does 
not colocalize with marker proteins of known nuclear 
bodies, including Cajal bodies, paraspeckles, nuclear 
speckles and PML bodies (data not shown). However, 



double DNA/RNA FISH (Figure IH) clearly revealed 
that CCATl-L accumulates at or near its site of transcrip- 
tion, suggesting a possible role of CCATl-L in local gene 
expression or chromatin organization. 

Knockdown of CCATl-L reduces MYC expression 

The in cis accumulation of CCATl-L suggests a pos- 
sible role in local gene expression. As only a few genes 
are expressed from the 8q24 region (Figure lA and 
Supplementary information. Figure SIB), we examined 
the relative expression of their steady-state mRNAs after 
knockdown of CCATl-L. While knockdown of CCATl- 
L by ASO had no effect on the expression of FAM84B, 
which is located 667 kb centromeric to CCATl-L, it led 
to modestly reduced expression of MYC, which is lo- 
cated 515 kb telomeric to CCATl-L, as revealed by both 
RT-PCR and northern blot in HT29 cells (Figure 2A and 
2B). In addition, we noticed a reduction of MYC protein 
after CCATl-L ASO treatment (Figure 2B). This reduc- 
tion was also observed at different times after the ASO 
treatment (data not shown) and in another CRC cell line 
HCT116 (Figure 2A). Importantly, we found that knock- 
down of CCATl-L greatly reduced the transcription of 
nascent MYC RNA (Figure 2C), further suggesting that 
CCATl-L regulates MYC expression at the transcriptional 
level. Nevertheless, while the reduction on the steady- 
state MYC mRNA that we observed was modest, it is 
possible that this at least partly reflects the known com- 
plexity of MYC regulation [21]. 

We next expressed CCATl-L from a plasmid expres- 
sion vector to see whether it could raise MYC expression. 
However, we observed that transient overexpression of 
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Figure 2 CCAT1-L regulates MYC expression in cis. (A) Knockdown of CCAT1-L led to modestly reduced expression of MYC 
in both HT29 and HCT116 cells. Top, bars represent ASO and arrows represent primer sets. Bottom, bar plots represent rela- 
tive expression of CCAT1-L, MYC and FAM84B 36 h post the ASO treatment (normalized to actin). (B) Northern blot and 
western blot (WB) revealed the reduced expression of MYC after knockdown of CCAT1-L in HT29 cells. Actin was used as a 
loading control in WB. (C) CCAT1-L regulates MYC expression at the transcriptional level. A crude preparation of nuclei was 
subjected to nuclear run-on assay under the indicated conditions in HT29 cells. Nascent transcription of MYC detected from 
scramble ASO-treated nuclei was defined as one. (D) Overexpression of CCAT1-L in trans in expression vector resulted in 
no apparent activation of MYC. Left, RT-qPCR validated the increased expression of CCAT1-L in the pEGFP-CI vector in 
HCT116 cells. Right, the overexpression of CCAT1-L in HCT116 cells did not lead to increase of MYC expression, as revealed 
by RT-qPCR. (E) Overexpression of CCAT1-L in vectors resulted in aberrant localization in the nucleus. RNA ISH (green) was 
performed with a probe recognizing CCAT1-L (Figure 1B) in HCT116 cells transfected with the CC/A7Y-L-expressing vector. 
Note that the overexpressed CCAT1-L produced from transfection vectors assembled as numerous nuclear dots. Representa- 
tive images are shown. Scale bar, 5 |im. Error bars in A, C and D represent ± SD in triplicate experiments. In A and C, P val- 
ues from one-tailed f-test in the pairwise comparison are shown. 



CCATl-L in trans resulted in no apparent activation of from the expression vector localizes to numerous nuclear 
MYC (Figure 2D) and that the overexpressed CCATl-L sites (Figure 2E) rather than to its endogenous in cis site 
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of accumulation (Figure IH). This lack of effect was not 
surprising, given that regulation of CCATl-L on MYC 
may require an in-cis action. Also, we have previously 
reported that other nuclear-retained IncRNAs, when ex- 
pressed from transfected vectors, did not localize to the 
sites of their genomic counterpart regions [32, 33] or ex- 
ert their roles in cis [33]. 

In cis overexpression of CCATl-L enhances MYC expres- 
sion and promotes tumorigenesis 

Recently developed targeted genome-editing technolo- 
gies using engineered nucleases, such as transcription 
activator-like effector nucleases (TALENs), provide a 
precise way to manipulate chromatin regions of inter- 
est (reviewed in [34]). We therefore applied TALENs to 
achieve in cis overexpression of CCATl-L in HCT116 



cells, which normally express a low level of CCATl-L 
(Figure IE). We inserted a CMV promoter and egfp just 
upstream of the CCATl genomic locus to achieve a sin- 
gle allele insertion to express the fusion egfp-CCATl-L 
RNA in cis (Figure 3A and Supplementary information. 
Figure S3A and S3B, TALEN A). As it has been shown 
that the insertion of a double poly(A) site cassette into 
the genome by zinc finger nucleases (ZNF) led to an effi- 
cient transcriptional stop [35], we inserted such a double 
poly(A) site cassette downstream of egfp to terminate the 
transcription of CCATl-L at the same genomic locus to 
obtain the control cell line (Figure 3A, Supplementary 
information. Figure S3A and S3C, TALEN B). Thus, 
these two TALEN-engineered cell lines offered a conve- 
nient system to allow transcription of either egfp-CCATl- 
L (TALEN A) or egfp (TALEN B) from the same pro- 
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Figure 3 In cis overexpression of CCAT1-L enhances MYC expression and tumorigenesis. (A) A schematic view of the strat- 
egy to in cis express CCAT1-L in HCT116 cells by TALEN. TALEN A, the CCAT1-L In cIs overexpression cell line. A cassette 
of CMV promoter and sequences of puromycin and egfp mRNAs was inserted just upstream of the first exon of CCAT1 by 
TALEN. TALEN B, the control cell line that overexpresses egfp. The same cassette of TALEN A but with two additional poly(A) 
sites to terminate the transcription downstream of egfp was inserted into the same genomic location as that in TALEN A (see 
Supplementary information, Figure S3 for details). Note that the transcription occurred in both egfp-CCAT1 -L- and egfp- 
overexpressing cell lines. (B) Northern blot validated the overexpression of egfp-CCAT1-L (left panel) or egfp (right panel) in 
different TALEN lines by using a probe recognizing egfp shown in A. (C) In cis overexpressed egfp-CCAT1-L (B) was poorly 
translated into EGFP due to its nuclear retention, while the overexpressed egfp was efficiently translated into EGFP. Fluores- 
cence microscopy (top) and WB (bottom) of representative TALEN A or TALEN B lines are shown. (D) Nuclear-retained egfp- 
CCAT1-L exclusively accumulated as a single nuclear dot, as revealed by ISH probes recognizing either CCAT1-L or egfp 
in the representative TALEN A clone. (E) Northern blot validated that egfp-CCAT1-L can be efficiently knocked down by the 
ASO that targets CCAT1-L, assayed with a probe recognizing egfp. (F) RT-qPCR revealed the overexpression of CCAT1- 
L in TALEN A lines, but not in control TALEN B lines (normalized to actin and the non-engineered HCT116 cells). Four lines 
of TALEN A and TALEN B cells were analyzed individually. (G) In cis overexpression of egfp-CCAT1-L enhanced MYC ex- 
pression, as revealed by RT-qPCR (normalized to actin and the non-engineered HCT116 cells). The same lines of TALEN 
A and TALEN B cells in F were analyzed. (H) In cis overexpression of egfp-CCAT1-L increased tumor formation in a mouse 
xenograft model. Xenograft tumors were collected 4 weeks after inoculation of cells. Left, representative xenograft tumors 
generated from a TALEN A and a TALEN B line are shown. Right, comparison was made between the egfp-CCAT1-L in cis 
overexpressing TALEN A lines and the egfp in cis overexpressed TALEN B lines. Note that xenograft tumors raised from indi- 
vidual in cis egfp-CCAT1 -L-overexpress'mg HCT116 cell lines were larger than those raised from control TALEN-engineered 
cell lines. Error bars in F-H represent ± SD in indicated multiple experiments. P values from one-tailed f-test in the painwise 
comparison are shown. 18S and 28S rRNAs were used as loading controls in all northern blots. Supportive data are included 
in Supplementary information. Figures S1-S4. 



moter in cis. However, using similar approaches we have 
been unable to obtain complete knockdown of CCATl-L, 
owing to amplification and hence multiple alleles of this 
locus in CRC lines. 

As expected, with a northern blot probe that recog- 
nizes egfp, we found that egfp-CCATl-L or egfp was 
expressed in the indicated TALEN lines (Figure 3B). 
Importantly, we observed that egfp-CCATl-L was rarely 
exported to the cytoplasm for EGFP translation (Figure 
3C, 3D and Supplementary information. Figure S3D), 
further confirming that it is a nuclear-retained IncRNA. 
In contrast, all control TALEN lines that only express 
egfp resulted in a strong EGFP fluorescence (Figure 3C 
and Supplementary information. Figure S3D). In addi- 
tion, the expression of egfp-CCATl-L was further con- 
firmed by a northern blot probe that recognizes CCATl 
(Supplementary information. Figure S3E), while no over- 
expressed CCATl-L was detected in the eg^-expressing 
control TALEN lines (Supplementary information. Fig- 
ure S3F), suggesting that the transcriptional terminator 
after egfp is efficient. The egfp-CCATl-L TALEN lines 
achieved a 15-30-fold increase in in cis expression of 
CCATl-L, while CCATl-L expression remained low in 
eg^-overexpressing control TALEN lines (Figure 3F and 
Supplementary information. Figure S3E and S3F). Final- 
ly, we found that the overexpressed egfp-CCATl-L was 
not efficiently processed to CCATl-S, as only the long 
isoform of CCATl appeared in all northern blots (Figure 



3B, 3E and Supplementary information. Figure S3E). 
Although these results suggest that the biogenesis of two 
isoforms of CCATl requires further attention, they also 
reveal that the in cis overexpression offers a clear system 
to evaluate the function of CCATl-L in the current study. 

Importantly, egfp-CCATl-L can also be knocked 
down by the same ASO that targets CCATl-L, as re- 
vealed by northern blot with an egfp probe (Figure 3E). 
Furthermore, this ASO treatment also led to a consistent 
reduction of MYC transcripts in several in cis CCATl- 
L-overexpressing HCT116 cell lines (Supplementary 
information. Figure S4A). It is worth noting that the in 
cis activated egfp-CCATl-L RNA does not colocalize 
to known nuclear bodies (Supplementary information. 
Figure S4B), but instead locates as one single nuclear ac- 
cumulation (Figure 3D) to its site of transcription (Figure 
4E). These characteristics of egfp-CCATl-L resemble 
known molecular features of the endogenous CCATl-L 
(Figure IF-IH). 

Compared to the control engineered TALEN B HCTl 16 
cells (Figure 3A), the in cis overexpressed egfp-CCATl- 
L enhanced MYC expression (Figure 3G). Furthermore, 
we also observed that eg/p-CC^J/ -L-overexpressing 
HCTl 16 cells grew faster than the control cell lines un- 
der low serum culture conditions (data not shown). More- 
over, transplantation of eg^-CC4r/ -L-overexpressing 
HCTl 16 cell lines resulted in larger xenograft tumors in 
nude mice when compared to eg^-overexpressing con- 
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Figure 4 The long-range interaction between the MYC promoter and its upstream regulatory elements. (A) The existence of 
multiple chromatin loops in the upstream region of MYC in HT29 cells. Physical map of the region spanning a 550 kb distance 
with CCAT1-L (MYC-515) at one end and MYC at the other, interrogated by 3C. Top, the position of the constant fragment 
containing MYC-335, a known region that is looping with the MYC promoter [22], is marked by a black bar (bait 1); positions 
of Hind\\\ restriction target fragments are marked by pink bars and primers were designed accordingly. Bottom, 3C interaction 
frequencies of the constant fragment with other fragments revealed the increased interaction between MYC-335 and MYC 
promoter and between MYC-335 and MYC-515. 3C products were confirmed by Sanger sequencing (examples were shown 
in Supplementary information. Figure S5B). The relative abundance of each 3C PGR product was determined using ImageJ, 
normalized by each corresponding input signal and the bait PGR product (set as 1.0), and labeled underneath. (B) The chro- 
matin loops in the upstream region of MYC. Top, the position of the constant fragment containing CCAT1-L locus (MYC-515) 
is marked by a black bar (bait 2); see A for details. (C) Double DNA FISH of CCAT1-L (green) and MYC (red) genomic loci in 
HT29 cells. FISH probes are labeled as black bars in A. White arrows indicate the co-localized loci from a representative cell. 
(D) The majority of CCAT1-L and MYC genomic regions are spatially close. Left, each HT29 cell contains multiple CCAT1- 
L and MYC loci, and the number of each locus per cell was calculated from totally 102 cells counted. Right, the majority of 
CCAT1-L loci in HT29 cells co-localize with MYC loci. (E) CCAT1-L RNA accumulates to chromatin regions at or near the 
MYC locus. Double FISH of egfp-CCAT1-L (green) with CCAT1-L DNA region, MYC locus and MYC-335 region revealed the 
co-localization of egfp-CCAT1 -L with these loci, but not with 15q11-13. Position-specific 10-15 kb probes (shown in A) or a 
probe recognizing15q11-13 [32] were used in DNA FISH. White arrows indicate the single-allele overexpressing egfp-CCAT1- 
L or its co-localized DNA regions in representative cells. (F) A schematic drawing of chromatin loops at the MYC locus. Loop 
1 (pink line) is between the MYC promoter (red box) and MYC-335 (brown box); loop 2 (blue line) is between MYC-335 and 
MYC-515 (green box); the spatially close localization of loop 1 and loop 2 resulted in the chromatin looping between the MYC 
promoter and MYC-515, which is "loop 3". In C and E, a single Z stack of representative images acquired with an Olympus 
1X70 DeltaVision Deconvolution System microscope is shown. Supportive data are included in Supplementary information. 
Figure S5. 



trol cell lines (Figure 3H). Taken together, we conclude 
that CCATl-L plays a role in tumorigenesis by positively 
regulating MYC expression in cis. 

The genomic locus encoding CCATl-L is spatially close 
to the MYC locus 

How does CCATl-L regulate MFC transcription across 
a distance of 5 1 5 kb on 8q24? It is known that chromatin 
looping can juxtapose genes or enhancers to spatially 
distant regions [36]. For example, a well-characterized 
chromatin loop between a locus 335 kb upstream (MYC- 
335) and the MYC promoter in CRC was reported, and 
M7C-335 was proposed to act as a transcriptional en- 
hancer for MYC [22, 23]. We reasoned that the CCATl- 
L-transcribing locus M7C-515 may also locate close to 
MYC by forming a chromatin loop. By using Chromo- 
some Conformation Capture (3C) to measure the chro- 
matin interaction frequency of a constant fragment at 
M7C-335 with a number of target fragments between the 
MYC promoter and M7C-515, we found that the highest 
interaction frequencies were between M7C-335 and the 
MYC promoter, and between M7C-335 and M7C-515 
(Figure 4A). The results were confirmed when a constant 
fragment was set at M7C-5 1 5 (Figure 4B) or in the MYC 
promoter (Supplementary information. Figure S5A) in 
3C experiments. These results clearly suggest that the ge- 
nomic locus encoding CCATl-L (M7C-515) is spatially 
close to MYC and M7C-335 in addition to the known 
chromatin loop between the MYC promoter and MYC- 



335 (Figure 4B). 

To further confirm our 3C results, we visualized the 
localization of the MYC promoter and M7C-515 by dou- 
ble DNA FISH in single HT29 cells. We designed two 
short probes (-10 kb in length) that recognize either the 
MYC promoter or M7C-515 to achieve a higher resolu- 
tion for double DNA FISH. The 8q24 region is amplified 
to at least 6 copies in HT29 cells [37, 38]. We found that 
on average each probe can detect 6 copies of the CCATl 
locus and 8 copies of the MYC locus in HT29 cells (Figure 
4C and 4D). Importantly, 95% of the detected CCATl 
loci colocalize with M7C loci (Figure 4D). 

Although CRC -derived cancer cell lines usually con- 
tain multiple chromatin copies of 8q24 (Figure 4C and 
4D), the in cis activated egfp-CCATl-L HCT116 cell line 
provided a clearer system that allowed us to visualize the 
localization of CCATl-L with its adjacent chromatin. In 
agreement with the spatially close localization of MYC- 
515, M7C-315 and the MYC locus, we found that these 
three loci, although separated by 515 kb in distance on 
8q24, are associated with CCATl-L RNA as revealed 
by the fact that in cis activated egfp-CCATl-L exhibited 
the strong colocalization with all of these loci (Figure 
4E, and a schematic view is shown in Figure 4F). Mean- 
while, a control DNA FISH that recognizes 15qll-13 
region [32] showed no co-localization with the in cis 
activated egfp-CCATl-L. Taken together, the chromatin 
organization at the MYC locus and the unique localiza- 
tion pattern of CCATl-L strongly support the notion that 
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Figure 5 CCAT1-L is required to maintain the cfiromatin looping at tlie MYC locus. (A) The chromatin region between MYC- 
515 and MYC-335 exhibits strong characteristics of a super-enhancer in HCT116 cells but not in HI cells (histone modifica- 
tions data were retrieved from ENCODE collection [39]). CCAT1-L, MYC-335 and MYC loci are highlighted in red. (B) Dis- 
tribution of H3K27ac signal across enhancers (outer figure) and super-enhancers (inner figure) in HCT116 cells. Rank and 
H3K27ac signal of enhancers and super-enhancers were downloaded from the literature [20]. 387 super-enhancers (black 
points) were identified from uneven distribution of H3K27ac signal among normal enhancers (grey points), and the CCAT1- 
L-associated super-enhancer (red point) is ranked as #12 super-enhancers with high H3K27ac signals. (C, D) Knockdown of 
CCAT1-L reduced the chromatin looping at the MYC locus. The long-range interaction frequencies between three chromatin 
regions {MYC-335IMYC, MYC-335/ MYC-5^ 5, and MYCIMYC-5^5) were reduced after knockdown of CCAT1-L as revealed 
by 3C assays in HT29 cells. Over 90% of CCAT1-L was depleted after the ASO treatment in 30 assays (data not shown). (E) 
Knockdown of CCAT1-L has no effect on the chromatin looping at the (3-globin locus. The same Hind\\\ restriction fragments 
were designed for 30 primers and PCRs were performed at the same time as C and D. (F) Knockdown of CCAT1-L reduced 
the chromatin looping at the MYC locus in H0T116 cell line with CCAT1-L in cis overexpression. The long-range interaction 
frequencies between the chromatin regions examined in B and C were also reduced after knockdown of CCAT1-L in the 
H0T116 cell line as revealed by 30 assays. In C-F, the relative abundance of each 30 POR product was determined using 
ImageJ and labeled underneath. 30 experiments were repeated three times. Supportive data are included in Supplementary 
information, Figures S5 and S6. 



CCATl-L can regulate MYC expression in cis. 

The genomic locus encoding CCATl-L is a strong super- 
enhancer 

One way to activate gene expression across large 
distances is by enhancer DNA elements that form long- 
range chromatin loops with their target genes [9]. Recent 
genome-wide analyses of chromatin markers in different 
human cell lines [39] have provided a rich resource to 
allow us to analyze the chromatin modifications in the 
M7C-515 region. Genome-wide mapping of enhanc- 
ers in HCT116 cells by searching for locations with 
high H3K27 acetylation (H3K27ac), high H3K4 mono- 
methylation (H3K4mel) but low H3K4 tri-methylation 
(H3K4me3) [40] revealed that the genomic locus encod- 
ing CCATl-L is an enhancer that spans up to 150 kb in 
length (Figure 5A). The size of this enhancer is distinct 
from that of a typical enhancer which is ~ 1.5 kb in length 
on average [17]. Analyzing the distribution of H3K27ac 
signal across enhancers further revealed that this region 
is a strong super- enhancer in HCT116 cells [20] (Figure 
5B). Also consistent with a role as a super-enhancer, 
this region is enriched for CBP/P300 binding sites and 
hypersensitive to DNase I but is devoid of H3K27me3 
modification (Figure 5A and data not shown). Moreover, 
in agreement with the highly cell- or tissue-specific 
feature of enhancers [9], we found that this enhancer is 
associated with enhancer-specific histone modifications 
in CRC cell lines, but not in other cell lines, such as the 
human embryonic stem cell HI line (Figure 5A). This is 
consistent with the very recent report that the gene desert 
surrounding MYC contains tumor type-specific super- 
enhancers in cancer cells, but not in their healthy coun- 
terparts [20]. 



CCATl-L is required for the maintenance of chromatin 
looping at the MYC locus 

Recent studies have suggested that a large fraction of 
enhancers can be bidirectionally transcribed into eRNAs 
[11] which have been proposed to contribute to gene acti- 
vation at a distance [12-16], presumably by the establish- 
ment or maintenance of enhancer-promoter looping. We 
asked whether the CRC-specific super-enhancer region- 
transcribed CCATl-L plays a role in the maintenance of 
chromatin looping between the MYC promoter and its 
enhancers. 

We measured the relative chromatin interaction fre- 
quency at the known interaction regions on 8q24 (Figure 
4) by 3C experiments before and after knockdown of 
CCATl-L. Strikingly, we observed that the interaction 
frequencies between M7C-335 and the MYC promoter 
and between M7C-335 and M7C-515 were significantly 
reduced after CCATl-L knockdown (Figure 5C). The 
interaction frequency between M7C-5 1 5 and the MYC 
promoter was also greatly reduced under the same treat- 
ment (Figure 5D). In contrast, another known loop at the 
P-globin locus was not altered (Figure 5E). Moreover, 
knockdown of CCATl-L in the CCATl-L-in cis overex- 
pressing cell line also reduced the interaction frequencies 
between the looping regions upstream of MYC (Figure 
5F). Importantly, these observations were further sup- 
ported by 3C-Seq [41] with bait fragment recognizing 
the CCATl-L locus {MYC-515) in control and CCATl-L- 
depleted cells (Supplementary information. Figure S6). 
Many long-range chromatin interactions were detected 
within the 500-kb region between the CCATl-L and MYC 
loci (Supplementary information. Figure S6), which was 
consistent with available 5C datasets generated by the 
University of Washington ENCODE GROUP (http:// 
genome .ucsc . edu/cgi-bin/hgTrackUi?db=hg 1 9&g=wg 
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Figure 6 CCAT1-L interacts with CTCF and modulates CTCF binding to chromatin. (A) ChlP-seq revealed that TCF4 and 
CTCF are enriched at 8q24 in HCT116 cells (ChlP-seq data were retrieved from ENCODE collection [39]). (B) CTCF is re- 
quired for chromatin looping at 8q24. Left, knockdown of CTCF was achieved using shRNA against CTCF as confirmed by 
WB. Right, the long-range interaction frequencies between three chromatin regions {MYC-335IMYC, MYC-335IMYC-5^5, and 
MYCIMYC-5^5, primers used in Figure 4) were reduced after knockdown of CTCF as revealed by 3C assays in HT29 cells. 
The relative abundance of each 3C PCR product was determined using ImageJ and labeled underneath. 3C experiments 
were repeated for three times. (C) CTCF is required for MYC and CCAT1-L expression. The relative abundance of MYC and 
CCAT1-L was analyzed by RT-qPCR in control and CTCF-knockdown HT29 cells. (D) CCAT1-L and CTCF interact in vitro. 
Top, a schematic view of four overlapping CCAT1-L RNA fragments for IVT. Bottom, biotin-labeled RNA pull-down assay 
using different fragments of CCAT1-L transcript in HT29 nuclear extracts showed that one fragment of CCAT1-L binding to 
CTCF. No CCAT1-L fragment was specifically associated with TCF4. (E) Interaction between endogenous CCAT1-L and 
CTCF was confirmed by RNA immunoprecipitation (RIP). RIP was performed with HT29 cells after UV crosslinking by using 
anti-CTCF, anti-TCF4 and anti-IgG, followed by RT-qPCR. Bar plots represent fold enrichments of RNAs immunoprecipitated 
by each indicated antibody over anti-IgG. SF?A, steroid receptor RNA activator [54]; sno-lnc5AC, H/ACA sno-lncRNA [32]; ci- 
ankrd52, a circular intronic RNA [33], (F) CCAT1-L modulates CTCF binding to chromatin. Knockdown of CCAT1-L reduced 
the interaction of CTCF to its occupied sites in chromatin. ChIP with anti-CTCF in scramble- and CCAT1-L ASO-treated HT29 
cells. Data were expressed as the percentage of CTCF co-precipitating DNAs in MYC promoter, MYC-335, /WYC-515 regions 
and negative CTCF binding sites on 8q24, versus input under each indicated condition (left). Control CTCF ChlPs were per- 
formed on positive and other negative CTCF binding sites (right). P values from one-tailed f-test in the painwise comparison 
are shown (*P < 0.05, **P < 0.01). In E and F, error bars represent ± SD in triplicate experiments. Supportive data are includ- 
ed in Supplementary information, Figure S6. 



EncodeUwSC). Importantly, loops between MYC and 
MYC-335, and between M7C-335 and MYC-515 were 
most significantly affected by knockdown of CCATl-L 
(Supplementary information, Figure S6). Together, these 
results suggest that the super-enhancer region-transcribed 
CCATl-L is required for the maintenance of certain chro- 
matin loops at the MYC locus in CRC cancer cells. We 
observed the same phenomenon when the same chroma- 
tin loops between the MYC promoter and its enhancers 
were assayed using different sets of "bait" and "fragment" 
PCR primer sets (data not shown) and in two additional 
biological repeats (data not shown). In all assays, 3C 
PCR products were Sanger sequenced to confirm that 
they mapped to the designed separated chromatin regions 
with a HincRll cutting site (Supplementary information. 
Figure S5B). 

CTCF plays a role in mediating long-range chromatin 
interactions between the MYC promoter and its enhanc- 
ers 

Analyses of the available ChlP-seq datasets [39] re- 
vealed that TCF4 and CTCF are highly enriched in this 
8q24 region in HCT116 cells (Figure 6A). Extensive 
TCF4 binding in the chromatin region between MYC- 
515 and M7C-335 was consistent with the notion that 
this chromatin region is a super-enhancer that can recruit 
transcriptional factors (Figure 5A). Expression of a dom- 
inant-negative TCF4 in HT29 cells resulted in reduced 
expression of both MYC and CCATl-L (Supplementary 
infonnation. Figure S7A), confirming a role of TCF4 in 
both MYC [42] and CCATl-L regulation, and suggesting 



that the transcription of CCATl-L is responsive to TCF4 
signaling in CRC. 

In addition, the specific enrichment of CTCF at sites 
for chromatin looping formation in the MYC promoter, 
M7C-335 and M7C-515 regions (Figure 6A) suggests 
that these chromatin loops observed at the MYC locus are 
CTCF-mediated. It is in agreement with a critical role 
of CTCF in mediating chromatin looping (rather than 
only functioning as an "insulator") [43-45]. This is what 
we have observed for the 8q24 region. Knockdown of 
CTCF disrupted these chromatin loops, as revealed by 
reduced chromatin interaction frequencies between the 
MYC promoter and its enhancers (Figure 6B). Impor- 
tantly, knockdown of CTCF led to reduced expression 
of M7C and CCATl-L (Figure 6C and Supplementary 
information. Figure STB), supporting the notion that 
CTCF and CTCF-mediated chromatin looping are re- 
quired to maintain proper transcription of both MYC and 
CCATl-L. This also indicates the possibility that CTCF 
and CCATl-L may participate in a positive regulatory 
network in control of MYC transcription by regulating 
the higher chromatin organization of 8q24 surrounding 
theM7Clocus. 

CCATl-L interacts with CTCF and modulates CTCF 
binding to chromatin at the MYC locus 

As CCATl-L is required for the maintenance of chro- 
matin looping at the MYC locus (Figure 5 and Supple- 
mentary information. Figure S6), we set up analyses to 
understand the underlying mechanisms by searching for 
CC4r/-Z-interacting proteins using biotin-labeled RNA 
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pull-down assays. TCF4 and CTCF are enriched at MYC- 
5 1 5 (Figure 6A) and it is known that many DNA-binding 
proteins can also bind to RNAs [46, 47]. As a first step to 
examine CC4r/ -L-associated proteins, we asked wheth- 
er CCATl-L is associated with TCF4 and CTCF. 

CCATl-L is 5 200 nt in length, which is too long to 
achieve an efficient biotin-labeled in vitro transcription 
(IVT) with a proper folding in denatured buffer (data 
not shown). Therefore, we generated several sense and 
antisense biotin-labeled IVT fragments that span the 
fiiU sequence of CCATl-L, each overlapping another by 
100 nt at both ends (Figure 6D). After incubation with 
nuclear extracts isolated from HT29 cells with individual 
biotin-labeled RNA fragments, we found that only the 
sense biotin-labeled 2 655-3 959 fragment of CCATl- 
L, which does not overlap with CCATl-S, specifically 
interacted with CTCF (Figure 6D). In contrast, no frag- 
ment could specifically bring down TCF4 or another 
abundant nuclear protein p54'"'' (Figure 6D and data not 
shown), confirming the specificity of this pull-down as- 
say. Furthermore, we confirmed the specific interaction 
between CCATl-L and CTCF by RNA immunoprecipita- 
tion (Figure 6E). We found that CCATl-L was efficiently 
co-precipitated with antisera directed against CTCF, but 
much less efficiently with those targeting TCF4 (Figure 
6E). Other abundant ncRNAs, H/ACA sno-lncRNA [32] 
or ciRNA [33] , could not be pulled down by these anti- 
bodies, demonstrating the specificity of the RNA precipi- 
tation assays. Together, these analyses strongly indicate 
that CCATl-L specifically interacts with CTCF. 

We finally asked whether the interaction of CCATl-L 
and CTCF contributes to the observed role of CCATl-L 
in the maintenance of the long-range chromatin interac- 
tions between the MYC promoter and its enhancers (Fig- 
ures 4, 5 and Supplementary information. Figure S6). 
Knockdown of CCATl-L led to a modest reduction of 
CTCF binding to chromatin at their occupied chromatin 
sites in loop-forming regions at the MYC locus (Figure 
6F). This suggests that CCATl-L IncRNA may act to lo- 
cally concentrate CTCF or allosterically modify CTCF 
binding to chromatin to maintain the chromatin looping 
in the 8q24 region surrounding the MYC locus in CRC 
cancers. We have consistently observed only modest ef- 
fects on CTCF binding to chromatin after knockdown of 
CCTAl-L, which is in agreement with the recent report 
that CTCF-occupied chromatin sites often anchor consti- 
tutive chromatin interactions [44]. 

Discussion 

Recent studies reported that many enhancers can bidi- 
rectionally express non-polyadenylated noncoding RNAs 



(eRNAs) with very low copy numbers [11, 48, 49]. Such 
eRNAs have been proposed to play an enhancer-like role 
in transcriptional activation at a distance, presumably by 
the establishment or maintenance of enhancer-promoter 
looping [12-16]. In addition, AR-associated IncRNAs 
PRNCRl and PCGEMl were shown to be involved in 
the regulation of AR-dependent gene activation events 
by a sequential recruitment of PRNCRl and PCGEMl 
to AR and the recognition of H3K4me3 marks by PC- 
G^'M/ -recruited PYG02 to enhance interactions of AR- 
bound enhancers with target gene promoters genome- 
widely [26]. In the current study, we found that the locus 
515 kb upstream of MYC is a super-enhancer (Figure 5A 
and 5B), which is about 150 kb in length and forms chro- 
matin loops with MYC (Figures 4, 5, Supplementary in- 
formation. Figures S5 and S6). Interestingly, it has been 
very recently highlighted as a tumor type-specific super- 
enhancer [20]. Our results showed that this MYC locus- 
related super-enhancer (Figure 5A) expresses a human 
colorectal cancer-specific IncRNA CCATl-L (Figure 1). 
This IncRNA is polyadenylated and is specifically accu- 
mulates in cis in the nucleus (Figure 1). It positively reg- 
ulates MYC transcription (Figures 2 and 3) by promoting 
chromatin interactions between MYC and its upstream 
regulatory elements (Figures 4, 5 and Supplementary 
information. Figure S6). Different from the reported eR- 
NAs or PCGEMl that are associated with components of 
Mediators, cohesin or chromatin modification enzymes 
[12-14], or directly open the chromatin accessibility [15], 
our data suggest that CCATl-L interacts with CTCF and 
modulates CTCF binding to chromatin to maintain the 
looping at the MYC locus (Figure 6). 

CTCF has been proposed to act as one of the master 
candidates as a global genome organizer to coordinate 
chromatin structures and regulate gene expression [50]. 
Recent analyses of CTCF-associated higher-order chro- 
matin structures by ChlA-PET [43] and the long-range 
interaction landscape of gene promoters by 5C [44, 45] 
have revealed a primary role of CTCF in mediating chro- 
matin looping, rather than only functioning as an insula- 
tor [51]. CTCF is particularly involved in the formation 
of the intermediate-length chromatin loops at scales of 
100 kb-1 Mb in a more constitutive way [44]. Consistent 
with this notion, we showed here that CTCF plays an im- 
portant role in MYC expression and in mediating 300-550 
kb long-range chromatin interactions between enhancers 
and the MYC promoter (Figure 6A-6C). However, im- 
portantly, we also observed that the association of CTCF 
with its occupied chromatin sites was influenced by ad- 
ditional context in cancers, such as the presence of the 
IncRNA CCATl-L in CRC (Figure 6). CTCF has been 
shown to bind to RNAs [52-54]. Depletion of RNAs ei- 
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ther affected its binding to cohesin [54] or altered CTCF 
binding to chromatin [52]. We found that knockdown of 
CCATl-L led to the modest reduction of CTCF binding 
to chromatin (Figure 6F). This observation not only fur- 
ther confirmed that the CTCF-occupied sites at the MYC 
locus are constitutive, but also was consistent with the 
notion that these chromatin loops at the MYC locus are 
under regulation [19, 20]. However, we do not yet know 
how precisely CTCF coordinates with CCATl-L to regu- 
late the chromatin looping at the MYC locus. Also, it is 
possible that CCATl-L may interact with other chromatin 
organizers or modifiers in addition to CTCF. 

MYC regulation is extremely complex. The physi- 
ologic transcriptional control of this gene still is not fully 
understood due to the lack of a complete annotation of 
regulatory elements across different tissue types [21]. 
Our study presented here, as well as others [19, 20], has 
demonstrated that the megabase-sized region of gene 
desert around MYC contains many regulatory elements 
(enhancers and super-enhancers) that form looping in- 
teractions with MYC in a tissue-Ztumor type-specific 
manner. Thus, the proper chromatin organization of 8q24 
gene desert region can be the key to precisely regulate 
MYC transcription under different physiologic condi- 
tions. In agreement with this notion, MYC transcripts 
in mice lacking M7C-335 were modestly reduced [24]. 
Knockdown of CTCF led to the reduced interactions 
between the MYC promoter and its enhancers (Figure 
6B) and a decreased MYC transcription as well (Figure 
6C). In addition, knockdown of CCATl-L RNA reduced 
the enhancer-promoter interactions at the MYC locus 
(Figures 5 and Supplementary information. Figure S6) 
and MYC transcription (Figure 2A-2C). The in cis over- 
expression of this IncRNA promoted MYC transcription 
and enhanced tumorigenesis (Figure 3 and Supplemen- 
tary information. Figure S4), presumably by modulating 
enhancer-promoter interactions (Figure 5 and Supple- 
mentary information. Figure S6). Thus, we propose that 
the expression of CCATl-L RNA may influence the chro- 
matin organization of 8q24 at the MYC locus, contribut- 
ing in part to the aberrant expression of MYC in human 
colorectal cancer pathogenesis. However, as it has been 
reported that transcriptional recruitment at certain locus 
may affect the expression of adjacent genes [55], we do 
not exclude the possibility that the acquirement of the 
transcription activity in the CCATl-L locus during CRC 
pathogenesis may also contribute to the regulation of 
MYC transcription. 

Although we do not know to what extent CCATl-L- 
regulated looping cross-talks with other aspects of MYC 
regulation, this study represents yet another component 
of the complicated MYC region. Finally, as this region 



also expresses distinct IncRNAs in other types of human 
cancers, it will be of interest to learn whether other 8q24 
IncRNAs behave in a similar way in MYC transcription 
in different cancers. 

Materials and Methods 

Cell culture and ASO treatment 

Human cell lines were cultured using standard protocols pro- 
vided by ATCC. Phosphorothioate-modified oligodeoxynucleotide 
(ASO) were synthesized at BioSune, Shanghai, China. The ASO 
was introduced into HT29 or HCT116 cells by nucleofection (Lon- 
za) according to the manufacturer's protocol. The scramble- and 
ASO-treated cells were collected for the following experiments. 

Total RNA isolation, human tissue RNA samples, RT-PCR, 
RT-qPCR and northern blot 

Total RNAs from cultured cells with different treatments were 
extracted with Trizol Reagent (Invitrogen). Twenty human tissue 
RNA samples were purchased from Ambion. For RT-PCR, after 
treatment with DNase I (Ambion, DNA-free TM kit), the cDNA 
was transcribed with Superscript II (Invitrogen), followed by 
PCR. For qPCR, the relative expression of genes was quantified 
to actin mRNA from three independent experiments. Primers for 
PCR and qPCR are listed in Supplementary information, Table 
S2. For northern blot, equal amounts of total RNAs collected fi^om 
cultured cells and primary tissue samples from patients were re- 
solved on 1 .5% agarose gels and northern blot was carried out as 
described previously [33]. Digoxigenin-labeled antisense CCATl- 
L probe and egfp probe were made using either SP6 or T7 RNA 
polymerases by IVT with the DIG Northern Starter Kit (Roche). 
Primers for northern blot probes are listed in Supplementary infor- 
mation. Table S2. 

Patient samples and RNA extractions from tissues and deep 
sequencing for gene expression analysis 

Human paired CRC/control mucosa samples were obtained 
from Changzheng Hospital, Shanghai under the strict guidance of 
ethical committee. After frozen tissue samples were powdered in 
liquid nitrogen, Trizol was added to extract RNA. RNA quality 
was examined by gel electrophoresis and only paired RNA with 
high quality was used for following analyses, including RNA-seq. 
RNA-seq libraries were prepared according to the manufacturer's 
instructions and then applied to sequencing on Illumina HiSeq 
2000 in CAS-MPG Partner Institute for Computational Biology 
Omics Core, Shanghai and Genergy, Shanghai. In all, 52 and 59 
million 1 x 100 single reads of the paired CRC/control mucosa 
RNA samples were obtained and were uniquely mapped to the 
hgl9 genome with over 85% of mapping rates for both cases. The 
gene expression analysis was carried out as described previously 
[32] and genes (mRNAs and IncRNAs) altered significantly in 
both samples are listed in Supplementary information. Table SI. 

Nuclear/cytoplasmic RNA fractionation, nuclear soluble/ 
insoluble RNA fractionation and polyadenylated/non-poly- 
adenylated RNA separation 

Nuclear and cytoplasmic RNA isolation in HT29 cells, nuclear 
soluble and insoluble RNA fractionation were performed as de- 



www.cell-research.com | Cell Research 



<^ CCAT1-L IncRNA regulates MYC by chromatin looping 



scribed before [32]. Polyadenylated and non-polyadenylated RNA 
separation in HT29 cells was carried out as described previously 
in HeLa cells [56]. Semi-quantitative RT-PCR was then used to 
evaluate the relative abundance of both CCATls, XIST and actin in 
each sample. 

RNA ISH and immunofluorescence microscopy 

To detect CCATl-L and CCATl-S, RNA ISH was carried out as 
previously described with in vitro transcribed digoxigenin-labeled 
antisense probes [32]. For colocalization studies, cells were co- 
stained with rabbit anti-Nucleolin (Santa Cruz Biotechnology), 
mouse anti-p54"' (BD) and mouse anti-Coilin (Sigma). The nuclei 
were counterstained with DAPI. Images were taken with a Zeiss 
LSM 510 microscope or with an Olympus 1X70 Delta Vision RT 
Deconvolution System microscope. 

RNA/DNA double FISH 

Sequential RNA/DNA double FISH experiments were carried 
out as described previously [32]. After RNA ISH, cells were dena- 
tured at 80 °C for 5 min in prewarmed 2x SSC and 70% deionized 
fomamide, pH 7.0. Next, cells were hybridized with denatured 
labeled DNA probe prepared by Nick Translation (Empire Genom- 
ics) overnight. After hybridization, two washes of 10 min at 37 °C 
with 2x SSC/50% deionized formamide, pH 7.0, followed by two 
washes of 15 min at 37 °C with Ix SSC and two washes of 15 min 
at 37 °C with 2x SSC were performed. Slides were then mounted 
with ProLong Gold antifade reagent with DAPI. Analyses were 
performed on single Z stacks acquired with an Olympus 1X70 
Delta Vision RT Deconvolution System microscope. Colocalization 
signals were detected in > 95% double-positive cells. 

Double DNA FISH and statistical analyses 

Double DNA FISH experiments were carried out as described 
above with minor modifications. Position-specific 10-15 kb probes 
were amplified by long-range PCR directly from genomic DNAs 
with primers listed in Supplementary infomiation. Table S2. These 
DNA FISH probes that target the specific looping interactions of 
8q24 were labeled with Alexa Fluor 488 (green) or 594 (red) by 
Nick Translation (Empire Genomics). After fixation and permea- 
bilization, cells were denatured at 80 °C for 5 min in prewarmed 
2x SSC and 70% deionized formamide pH 7.0. Next, cells were 
hybridized overnight at 37 °C with denatured labeled DNA probes. 
After hybridization, two washes of 10 min at 37 °C with 2x 
SSC/50% deionized fomamide, pH 7.0, followed by two washes 
of 15 min at 37 °C with Ix SSC and two washes of 15 min at 37 
°C with 2x SSC were performed. Slides were then mounted with 
ProLong Gold antifade reagent with DAPI. Pictures were taken 
with an Olympus 1X70 Delta Vision RT Deconvolution System mi- 
croscope. Colocalization signals were analyzed in double-positive 
cells. 

UV crosslinking RNA immunoprecipitation 

UV crosslinking RIP was carried out as described before [33]. 
Two 10 cm" dishes of HT29 cells with 90%- 100% confluence were 
washed twice with 5 ml cold PBS and irradiated at 300 mj/cm at 
254 nm in a Stratalinker Cells were collected and resuspended in 
1 ml RIP buffer (50 mM Tris-HCl, pH 7.5, 150 mM NaCl, 1% NP- 
40, 0.5% sodium deoxycholate, 1 mM PMSF, 2 mM VRC, prote- 
ase inhibitor cocktail). Then cells were homogenized by 3 rounds 



of sonication on ice. Insoluble material was removed by centrifu- 
gation and the supernatant was pre-cleared with Dynabeads G 
(Invitrogen) with 20 )ig/ml yeast tRNA at 4 °C for 30 min. The 
pre-cleared lysate was incubated with Dynabeads G that were pre- 
coated with 2 )ig antibodies of anti-CTCF (Millipore), anti-TCF4 
(Millipore) or IgG (Sigma) for 4 h at 4 °C. The beads were washed 
sequentially with washing buffer I (50 mM Tris-HCl, pH 7.5, 1 
M NaCl, 1% NP-40, 1% sodium deoxycholate, 2 mM VRC) and 
washing buffer II (50 mM Tris-HCl, pH 7.5, 1 M NaCl, 1% NP- 
40, 1% sodium deoxycholate, 2 mM VRC, 1 M urea) for multiple 
times. The immunoprecipitated complex was eluted from Dyna- 
beads G by adding 140 ^il elution buffer (100 mM Tris-HCl, pH 7.0, 
5 mM EDTA, 10 mM DTT, 1% SDS). 5 ^il of 10 mg/ml proteinase 
K was then added to the retrieved RNA samples and incubated at 
55 °C for 30 min, followed by RNA extraction and qPCR. 

Chromatin immunoprecipitation (ChIP) 

1 X lo' cultured cells with each treatment were washed with 
ice-cold PBS, crosslinked with 1% formaldehyde and quenched 
by 0.125 M glycine. After being resuspended with 1 ml ChIP ly- 
sis buffer (1% Triton X-100, 0.1% sodium deoxycholate, 50 mM 
Tris, pH 8.0, 150 mM NaCl, 5 mM EDTA), cells were sonicated 
to achieve the majority of DNA fragments with 200-500 bp. Su- 
pernatants were collected and pre-cleared with Dynabeads G in 
Chip lysis buffer with the supplement of 100 |ig BSA and 100 |ig 
ssDNA. Then, the pre-cleared cell lysates were used for ChIP with 
2 )ig CTCF antibody (Millipore) for overnight incubation at 4 °C. 
The beads were then washed with 600 |il ChIP lysis buffer, 600 jil 
high salt wash buffer (1% Triton X-100, 0.1% deoxycholate, 50 
mM Tris, pH 8.0, 500 mM NaCl, 5 mM EDTA), 600 ^il of LiCl 
immune complex wash buffer (0.25 M LiCl, 0.5% Igepal, 0.5% 
deoxycholate, 10 mM Tris, pH 8.0, 1 mM EDTA), followed by 
two washes with 600 ^il Ix TE Buffer (10 mM Tris, pH 8.0, 1 mM 
EDTA) at 4 °C. The immunoprecipitated complex was eluted from 
Dynabeads G by adding 200 )\.\ fresh-prepared elution buffer (1% 
SDS, 0.1 M NaHCOj) with rotation at room temperature (RT) for 
15 min. Then the reverse crosslinking was carried out by adding 
8 |il of 5 M NaCl and incubating at 65 °C for 4 h, followed by 
incubating with the supplement of 4 |il of 0.5 M EDTA and 10 |il 
proteinase K (10 mg/ml) at 55 °C for 2 h. DNA was recovered by 
phenol/chloroform extraction and ethanol precipitation, followed 
by qPCR with the primers listed in Supplementary information. 
Table S2. 

Biotin-labeled RNA pull-down assay 

Biotin-labeled RNAs pull-down assay was perfomed as de- 
scribed [33, 57, 58] with minor modifications. Biotin-labeled 
CCATl-L truncation probes were in vitro transcribed with the 
Biotin RNA Labeling Mix (Roche) and AmpliScripe T7/SP6-flash 
Transcription Kit (Epicentre). 4 ng biotinylated RNA was dena- 
tured for 5 min at 65 °C in PA buffer (10 mM Tris-HCl, pH 7.5, 
10 mM MgCU, 100 mM NH^Cl) and slowly cooled down to RT. 2 
X 10' HT29 cell pellet was used for each assay. Briefly, HT29 cell 
nuclei [33] were resuspended in 1 ml RIP buffer (25 mM Tris-HCl, 
pH 7.5, 150 mM KCl, 0.5 mM DTT, 0.5% NP-40, 1 mM PMSF, 2 
mM VRC, protease inhibitor cocktail). The nuclei were sonicated 
on ice followed by centrifugation at 13 000 rpm for 10 min at 4 °C. 
The supernatant was transferred to a new tube and pre-cleared by 
applying 40 ]\\ Streptavidin Dynabeads for 20 min at 4 °C. Then 
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20 |ig/ml yeast tRNA was added to block nonspecific binding and 
incubated for 20 min at 4 °C. Folded RNAs were then added and 
incubated for 1 .5 h at RT, followed by the addition of 40 \i\ Strep- 
tavidin Dynabeads to incubate for 1 .5 h. Beads were washed with 
RIP buffer containing 0.5% sodium deoxycholate and denatured 
in 1 X SDS loading buffer The retrieved proteins were analyzed by 
western blot with anti-CTCF (Millipore) and anti-TCF4 (Millipore) 
antibodies. 

Nuclear run-on (NRO) 

The NRO assay in HT29 cells was perfomied as described pre- 
viously [33] with minor modifications. In brief, HT29 cells were 
washed with cold PBS quickly and collected for NRO assay 24 h 
post nucleofection with ASOs. Collected cells were incubated in 
swelling buffer (10 mM Tris, pH 7.5, 2 mM MgCl^, 3 mM CaCl^) 
on ice for 5 min and were then collected by centrifugation. Cell 
pellets were subjected to lysis twice with 1.5 ml lysis buffer (10 
mM Tris, pH 7.5, 2mM MgCl,, 3 mM CaCl^, 0.5% Igepal, 10% 
glycerol, and 2 U/ml RNasin Ribonuclease Inhibitor) to obtain 
purer nuclei. The resulting nuclear pellets were resuspended in 100 
III NRO buffer (50 mM Tris, pH 7.5, 5 mM MgCl,, 150 mM KCl, 
0.1% sarkosyl, 2 U/ml RNase inhibitor and 10 mM DTT) contain- 
ing 0.1 mM ATP, GTP, CTP and BrUTP (Sigma). Transcription 
was performed for 3 min on ice and then 5 min at RT. The reaction 
was stopped by addition of 600 (il Trizol to extract RNA, followed 
by the DNase I (Ambion, DNA-free TM kit) treatment to remove 
genomic DNA. Purified RNAs were incubated with 2 jig anti- 
BrdU antibody (Sigma) or equal amount of IgG antibody (Sigma) 
at 4 °C for 2 h and were then immunoprecipitated with Dynabeads 
G pre-coated with yeast tRNA (Sigma). Precipitated RNAs were 
extracted by Trizol and were used for cDNA synthesis and qPCR 
analysis. 

Chromosome conformation capture (3C) 

3C was performed as described [59] with modifications. HT29 
cells and HT29 cells under different treatments were crosslinked 
with 1% formaldehyde for 10 min at RT, followed by quenching 
the crosslinking with addition of 0.125 M glycine. The nuclei were 
collected and dounced in lysis buffer, followed by adding 400 U 
of Hindlll restriction enzyme for overnight digestion at 37 °C with 
shaking. Ligation was performed for 4 h at 16 °C followed by 1 h 
incubation with 50 U of T4 DNA ligase at RT. Reverse crosslink- 
ing was performed in the presence of proteinase K overnight at 65 
°C. The genomic DNA was then extracted by phenol-chlorofomi. 
After RNase A treatment, the genomic DNA was qualified with 
an input primer to balance the input chromatin across different 
samples. After normalization with input, equal amount of chroma- 
tin DNA was used in each PCR reaction to identify the chromatin 
loops and to compare the alterations between the targeted chroma- 
tin loops under different treatments. All 3C primers were designed 
according to a previous report [60] and listed in Supplementary 
information. Table S2. The PCR products with expected sizes 
were Sanger sequenced to ensure that a specific product is exactly 
the sequence of the ligation event. The 3C control chromatin loop 
(|3-globin) was also examined in the same way as described above. 

3 C Sequencing 

The 3C-Seq libraries were prepared step by step as described 
by Stadhouders R et al. [41]. Hindlll and Dpnll were used as 



the primary and secondary restriction endonuclease. The 3C-Seq 
library was sequenced (70 bp reads) with Illumina HiSeq 2000. 
The 3C sequencing signals were normalized with both the total 
sequencing reads and the highest frequency signal nearby the bait 
primer. The relative signals were shown in a "CC4ri-to-MFC" 
viewpoint in Supplementary infomation. Figure S6. 

In cis overexpression of CCATl-L or EGFP with TALEN 

All TALENs were designed and assembled according to litera- 
ture [61]. The TALEN assembly kit was obtained from Addgene. 
The target sequence of CCATl-L TALEN is TCATCATTACCAG- 
CTGCCGT and TTTCTGTGAATCGTGAGCGT. The homolo- 
gous arm sequences for donor plasmids are chr8: 128231306- 
128232194 and chr8: 128230503-128231297. After insertion of the 
homologous anus amplified from the genomic DNA of HCT116 
cells into pCRII vector (backbone for Donor plasmid construc- 
tion), the different regulatory modules {CMVIPurolegfp and the 
BGHISV40 poly(A) sites) were amplified from commercially 
available expression vectors including pEGFP-Cl and pLKO.l, 
followed by the insertion into the donor plasmid by overlapping 
PCRs to obtain desired TALEN vectors (Figure 3C and Supple- 
mentary information. Figure S4A). All plasmids were validated by 
Sanger sequencing. After transfection of individual sets of TALEN 
vectors and donor plasmids into HCT116 cells, puromycin was 
added to facilitate positive single clone screening. Genomic DNAs 
of selected single clones were extracted for the genotyping valida- 
tion with appropriate sets of primers listed in Supplementary infor- 
mation. Table S2. The positive clones were scaled up for banking 
and detailed experiments (Figures 3, Supplementary information, 
Figures S4 and S5). 

Tumor xenograft assay 

Female nude mice between 4 and 6 weeks were obtained from 
SLAC laboratory animal company and bred in SPF animal house. 
All animal work was done in accordance with a protocol approved 
by the Shanghai Experimental Animal Center (Chinese Academy 
of Sciences). After 1 week feeding after reaching the animal house, 
mice were inoculated subcutaneously with 3-4 x 10*' indicated 
cells of individual TALEN-engineered cell lines. Mice were main- 
tained in SPF animal house and were sacrificed for tumor weight 
analyses when the xenografts reached about 1.5 cm in diameter (at 
4 weeks). 

Accession numbers 

Raw sequencing dataset and bigWig track file of paired human 
CRC/control mucosa samples and 3C-Seq datasets are available 
for download from NCBI Gene Expression Ormiibus under acces- 
sion numbers GSE55259 and GSE55261. 
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